Qualcomm AI Engine Direct - Enable per channel linear op#2822
Qualcomm AI Engine Direct - Enable per channel linear op#2822chunit-quic wants to merge 1 commit intopytorch:mainfrom
Conversation
chunit-quic
commented
Apr 3, 2024
- Add per channel weight quantization for linear op
- Bias quantization for per channel weight Linear op is not support yet
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2822
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 2efac57 with merge base 9fd1a0e ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
|
Hey thank you for submitting, do you mind rebasing? Seems like CI was broken and it's fixed in main branch now. |
- Add per channel weight quantization for linear op - Bias quantization for per channel weight Linear op is not support yet
e3c8e0c to
2efac57
Compare
|
Hi @cccclai , since @chunit-quic is on PTO, I help rebase this onto latest mainline. |
|
@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |